le723z
/

Rearank-7B

text-generation

text-generation-inference

Model card Files Files and versions Community

This is a reasoning reranking agent model built upon Qwen-2.5-7B for the paper REARANK: Reasoning Re-ranking Agent via Reinforcement Learning. The model is trained on reranking dataset built from only 179 queries using GRPO to perform reranking task, the codebase is at https://github.com/lezhang7/Rearank

Downloads last month: 95

Safetensors

Model size

7.62B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for le723z/Rearank-7B

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Finetuned

(2327)

this model

Quantizations